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ABSTRACT Vascular endothelial growth factor re- 
ceptor 3 (VEGFR-3) is required for cardiovascular 
development during embryogenesis. In adults, this re- 
ceptor is expressed in lymphatic endothelial cells, and 
mutant VEGFR3 alleles have been implicated in human 
hereditary lymphedema. To better understand the basis 
of its specific endothelial lineage-restricted expression, 
we have characterized the VEGFR3 gene and its regula- 
tory 5' flanking region. The human gene contains 31 
exons, of which exons 30a and 30b are alternatively 
spliced. The VEGFR3 proximal promoter is TATA-less 
and contains stretches of sequences homologous with 
the mouse Vegfr3 promoter region. In transfection 
experiments of cultured cells, the Vegfr3 promoter was 
shown to control endothelial cell-specific transcription 
of downstream reporter genes. This result was further 
confirmed in vivo; in a subset of transgenic mouse 
embryos, a 1.6 kb Vegfr3 promoter fragment directed 
weak lymphatic endothelial expression of the LacZ 
marker gene. This suggests that endothelial cell-specific 
elements occur in the proximal promoter, although 
further enhancer elements are probably located else- 
where. The sequence, organization, and variation in the 
VEGFR3 gene and its regulatory region provide impor- 
tant tools for the molecular genetic analysis of the 
lymphatic system and its disorders. — Iljin, K., Kark- 
kainen, M. J., Lawrence, E. C, Kimak, M. A., Uutela, 
M., Taipale, J., Pajusola, K., Alhonen, L., Halmekyto, 
M., Finegold, D. N., Ferrell, R. E., Alitalo, K VEGFR3 
gene structure, regulatory region, and sequence poly- 
morphisms. FASEB J. 15, 1028-1036 (2001) 
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Endothelial cells (ECs) lining the blood and lym- 
phatic vessels are dependent on receptor tyrosine ki- 
nase (RTK) -mediated signaling, which leads to growth 
or differentiation responses in the target cells. This 
signaling is required for normal development and 
maintenance of the vascular bed as well as for angio- 
genic responses in pathological conditions. Vascular 



endothelial cells express nonendothelial lineage- 
restricted RTKs such as receptors for fibroblast growth 
factors and epidermal growth factors, but also endothe- 
lial cell-restricted RTKs (reviewed in refs 1, 2). Two 
major families consist of vascular endothelial growth 
factor receptors (VEGFRs) and the angiopoietin recep- 
tors, also called Tie (tyrosine kinase with immunoglob- 
ulin and epidermal growth factor homology domains) 
receptors. VEGFRs are distinguished by an extracellular 
domain containing seven immunoglobulin (Ig) homol- 
ogy domains. The VEGFR family has three members: 
VEGFR-1 (also known as flt-1), VEGFR-2 (KDR/flk-1), 
and VEGFR-3 (FLT4) (3-7). In VEGFR-3, the fifth Ig 
homology domain of the extracellular part is proteo- 
lytically cleaved and the resulting polypeptides remain 
linked by two disulfide bonds (8). The ligands binding 
to VEGFRs belong to the VEGF family of growth factors, 
which has five cellular members: VEGF, placenta 
growth factor (P1GF), VEGF-B, VEGF-C, VEGF-D (9- 
13), and the recendy cloned viral VEGF homologue 
VEGF-E (14-16). VEGFs have specific sites of expres- 
sion and show distinct binding patterns to the different 
VEGFRs, reflecting the complex biological signaling 
involved in endothelial cell functions. 

Targeted deletion of genes encoding VEGFRs in 
mice leads to impaired development of the vasculature 
and embryonic death (17-20). Despite the importance 
of the VEGFRs in the development of blood vascula- 
ture, VEGFR-3 is unique among these receptors in that 
it is found almost exclusively in lymphatic endothelium 
in adults. The expression of VEGFR-3 starts at embry- 
onic day (E) 8.5 of mouse development in the angio- 
blasts of the head mesenchyme, the cardinal vein, and 
the allantois (21). During development, however, the 
strongest VEGFR-3 mRNA expression becomes gradu- 
ally restricted first to venous endothelia, and subse- 
quently to the lymphatic vessels (22). 

Whereas blood vessel formation has been extensively 
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studied, less is known about the regulation of lymphatic 
vessel growth. The development of lymphatic vessels 
starts at El 2 by centrifugal extension of vessels from 
venous sacs in the perimetanephric and jugular regions 
(20, 23). Thereafter, lymphatic vessels extend to most 
tissues growing around major arteries and become 
particularly abundant in the mesenterium and medias- 
tinum. VEGFR-3, the first lymphatic endothelial cell- 
specific RTK, was cloned as an orphan receptor from 
human erythroleukemia cell and placental cDNA li- 
braries (6, 7) and the VEGFR3 gene was located to the 
chromosomal segment 5q33-q35 (7, 24). Subsequently, 
VEGF-C and VEGF-D were found to bind to and to 
activate VEGFR-3 (12, 13, 25). VEGF-C mRNA is ex- 
pressed in close proximity to its receptor VEGFR-3 
during mouse development (21). Accordingly, VEGF-C 
is capable of inducing hyperplasia of lymphatic vessels 
in transgenic mice and in chick chorioallantoic mem- 
brane (26, 27). These results suggest that VEGFR-3 
plays a role in the generation of the lymphatic system. 

The regulatory regions of six genes sufficient to 
target expression of heterologous genes specifically to 
the endothelial cells in vivo have been described. These 
comprise the promoters/enhancers of the Tiel (28- 
30), Tie2 (30, 31), Vegfr2 (32), intercellular adhesion 
molecule 2 (ICAM2) (33, 34), von Willebrand factor 
(vWF) (35, 36), and vascular endothelial cadherin 
(VE-cadherin) (37) genes. These genes are expressed 
in the endothelium of blood vessels, whereas the 
VEGFR3 expression is relatively specific for lymphatic 
endothelial cells in adults except it is also expressed in 
some fenestrated blood endothelial cells and angio- 
genic blood vessels in tumors (38, 39). It is not clear 
which transcription factors regulate VEGFR3 expression 
or what molecular mechanisms are responsible for the 
restriction of its expression to the lymphatic endothe- 
lium during development. Here we have cloned and 
analyzed the VEGFR3 gene and characterized its struc- 
ture as well as the promoter region responsible for its 
endothelial cell-specific expression. 



MATERIALS AND METHODS 

Genomic organization, exon-intron sequences, and 
polymorphisms of the human VEGFR3 

To characterize the structure of the VEGFR3, a genomic 
cosmid clone (24) was subcloned as EcdRl fragments (10, 9.5, 
7.0, 4.3 and 3.5 kb) into the pGEM3 vector (Promega, 
Madison, Wis.). The genomic clones were ordered by using 
2.5, 1.2, and 0.6 kb EcoRl fragments of VEGFR3 cDNA (40) as 
probes in Southern blotting analysis. The cosmid subclones 
were sequenced using primers based on the published 
VEGFR3 cDNA (GenBank access, no. X68203 for VEGFR-3S 
and S66407 for the carboxyl-terminal tail of VEGFR-3L) (40). 
The sequence data from the cosmid clone was independently 
confirmed by using cDNA primers from the ends of exons to 
amplify segments of genomic DNA. We sequenced — 80% of 
the genomic DNA using PCR amplimers, cycle sequencing 
with dRhodamine ready reactions dye Terminator kit, and 
running them out on an AB1 Prism Model 377. The se- 



quences were analyzed further and aligned on Sequencer 3.1. 
(GeneCodes). Polymorphic variation in the VEGFR3 gene was 
identified by resequencing of a minimum of 50 chromosomes 
and allele frequencies were estimated from the sequence 
results. Recently, a genomic region of chromosome 5 contain- 
ing the VEGFR3 gene region in unordered fragments was 
published (AC022095), and a direct sequence comparison 
confirmed our results. 

Cell culture and transfection assays 

To analyze the function and activity of the Vegfr3 promoter, 
luciferase reporter gene constructs were used in transfection 
assays of cultured cells. A 3 kb Kpnl/Nod genomic fragment 
of the mouse genomic clone comprising the putative Vegfr3 
promoter was cloned into the pGL3 basic reporter vector. 
pGL3 basic and pGL3 control vectors contained the firefly 
(Photinus pyralis) luciferase gene without promoter and 
driven by the SV40 promoter/enhancer, respectively. A re- 
striction enzyme map of the mouse 3 kb Vegfr3 genomic 
fragment was made, deletions were introduced into the 
promoter fragment with the restriction enzymes indicated, 
and DNA fragments were inserted into the pGL3 basic vector. 
All constructs were confirmed by sequencing. 

Mouse lung (LE-II) and brain endothelial cells (BEND) 
were grown in minimal essential medium and NIH3T3 mu- 
rine fibroblasts in Dulbecco's modified minimum essential 
medium (DMEM) containing 10% fetal calf serum and 
antibiotics. Mouse keratinocytes (MK-2) were grown in low (2 
mM) CaCl 2 DMEM containing 10% fetal calf serum, antibi- 
otics, and epidermal growth factor (4 ng/ml). Cells were 
transfected using the calcium phosphate method. The mini- 
mal amount of plasmid registering full transcriptional activity 
was titrated prior to the experiments. In subsequent cases, 5 
u,g of the appropriate reporter construct was transfected 
along with 0.2 u.g of pRLTK (Promega) to control the 
variability in transfection efficiency. The pRL-TK vector con- 
tains the Herpes simplex virus thymidine kinase promoter 
region upstream of the ReniUa reniformis luciferase gene. Cell 
extracts were prepared 48 h after transfection in passive lysis 
buffer (Promega). Luciferase activity was measured using 
Digene DCR-1 luminometer and Promega Dual Luciferase 
Assay System. The ratio of firefly luciferase activity to renilla 
luciferase activity in each sample served as a measure of the 
normalized luciferase activity, which was divided by the activ- 
ity of the pGL3 control vector and expressed as relative 
luciferase activity. Each construct was transfected at least five 
times and data for each construct are presented as the 
mean ± se. Relative luciferase activities among constructs 
were compared by a factorial analysis of variance, followed by 
Student's t test. Statistical significance was accepted at 
P<0.05. 

Production and analysis of Vegfr3 promo ter-LacZ embryos 

A //indlll fragment of mouse Vegfr3 genomic sequence was 
cloned to the //indlll site of pSDKLacZpA. Transgenes con- 
taining either 3.6 kb //itwUII/AM, 1.6 kb Bsdi/Noil, or 0.8 kb 
Spd/Notl fragment of Vegfr3 genomic sequence, followed by 
the p-galactosidase gene and SV40pA, were excised from the 
plasmid by digestion with Safl, ApdL t and Bsal/Spd, respec- 
tively. Transgenic DNA fragments were microinjected into 
fertilized oocytes of the FVB/N strain of mice and the 
injected zygotes were transplanted into oviducts of pseudo- 
pregnant C57BL/6xDBA/2J hybrid mice. Embryos were dis- 
sected from the decidua on day E15.5 and stained for 
p-galactosidase expression as described previously (29). The 
embryos were genotyped by PCR analysis of the amnion DNA 
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using primers 5'-TCTGTCGATCCTT-3' and 5'-GCTGGAT- 
GCGGCGTGCGGT-3' specific for the Loci gene. 



RESULTS 

Organization and sequence variation of the human 
VEGFR3 gene 

We recently found mutations in the VEGFR3 gene in 
hereditary lymphedema (41, 42). Because VEGFR3 is 
one of a handful of receptor tyrosine kinase genes 
involved in human hereditary disease, we wanted to 
characterize this gene and its sequence variation. A 
cosmid clone containing the human VEGFR3 genomic 
sequences was analyzed by Southern blotting and hy- 
bridization with subcloned VEGFR3 cDNA fragments as 
probes, and suitable EcoRl fragments were subcloned 
into plasmid vectors. To characterize the exon-intron 
boundaries, these subclones were sequenced with 
primers based on the published human VEGFR3 cDNA 
sequence (40). The VEGFR3 gene was found to consist 
of 31 exons (Fig. LA), the largest of which (exon 13) is 
only 363 bp in length (Table 1). The 3' exons 30a and 
30b are alternatively spliced into the mRNA, producing 
polypeptide isoforms differing by 65 amino acid resi- 
dues (43). The nucleotide sequences of VEGFR3 exon- 
intron junctions were determined in their entirety; the 
lengths of most of the introns were estimated by using 
genomic PCR and further confirmed by comparing the 
lengths of the PCR products to the genomic sequence 
covering most of the VEGFR3 gene (Table 1). All 
exon-intron boundaries were found to conform to the 
5'-GT-AG-3' splice site rule. The first intron of the 
VEGFR3 was only partially sequenced. Its length is at 
least 13.5 kb whereas the rest of the coding region from 
the second exon to in exon 30b spans more than 29 kb 
of genomic DNA. 

The overall genomic organization of the VEGFR3 
gene is highly similar to the previously published 
organization of the Vegfrl and VEGFR2 genes (44, 
45). The coding capacity of the different exons is 
indicated in Fig. I A in relation to the domain 
structure of the mature VEGFR-3 receptor protein, 
which is presented in Fig. IB. Resequencing of 
VEGFR3 in a minimum of 25 individuals identified 22 
intragenic single nucleotide polymorphisms (SNPs) 
and an intronic (CA) n repeat element. Six SNPs 
occurred in exons, two synonymous substitutions 
(G507T and T3198C), and four predicted to cause 
amino acid substitutions in the coding sequence 
(N149D, T494A, P641S, and R1146H). The intra- 
genic polymorphic sites and allele frequency esti- 
mates are indicated in Table 2. 

Mouse and human VEGFR3 promoters contain two 
distinct regions of homology 

Sequences upstream of the mouse Vegfr3 gene were 
cloned from a mouse genomic DNA library. Sequenc- 



ing of clone F2N2.7, encompassing a 2.7 kb fragment 
upstream of the Notl site in the 5' leader sequence of 
the first coding exon, revealed that its 5'-end contains 
— 1.3 kb of repetitive sequences, followed by 1.4 kb of 
sequences not homologous to known genes (Fig. 1C). 
These sequences had two distinct regions of nonrepeti- 
tive sequence homology to the human promoter region 
that were separated by an Alu-like element in the 
human clone. These conserved regions, called homol- 
ogy regions 1 and 2 (HR1 and HR2 in Fig. 1Q, showed 
—70% identity between the mouse and human se- 
quences. HR1 contained conserved putative transcrip- 
tion factor binding sites, including a putative site for 
myocyte enhancer factor 2 (MEF-2) and several possi- 
ble binding motifs for transcription factors for mem- 
bers of the Ets family, marked PEA3 and E1A-F (Fig. 
1Q. HR2 was immediately upstream of the Notl site 
and contained multiple conserved binding sites for Spl 
like factors, as well as sequences that resemble the 
transcription start sites found in genes which do not 
contain a TATA box (Fig. 1C; 46). 
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Figure 1. Exon-intron structure of the human VEGFR3 and 
comparison of mouse and human VEGFR3 promoters. i4JThe 
genomic structure of human VEGFR3 consists of 31 exons 
(black boxes) of which 30a and 30b are alternatively spliced. 
The VEGFR3 exons encode seven Ig-homology domains (I- 
VII), a transmembrane region (TM), and tyrosine kinase 
domains (TK I and TK II), which are interrupted by the 
kinase insert sequence. B) Schematic structure of the 
VEGFR-3 receptor protein. C) Schematic structure of the 
upstream regions of the mouse and human VEGFR3 genes. 
Repetitive sequences are marked with green boxes, homology 
regions 1 and 2 (HRI and HR2) as orange and blue boxes, 
respectively. In the human promoter, these elements are 
separated by an Alu repeat. The sequences of HRI and HR2 
with conserved putative transcription factor binding sites are 
shown in the figure. 
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Table 1. Exon-intrxm boundaries of the hVEGFR3 n 



Exon 


Bp* 


Splicc acceptor 


5' of exon 


Exon Size 
<CDS; r bp) 


3' of exon 


1 


20-77 




ATGCAGCGG 


58 


TCCTGGACG 


2 


78-174 


caccacgctgaccctgtctcccaccccag 


GCCTGGTGA 


97 


CTCCTGCAG 


3 


175-419 


cccacctgaccactcctgtcctctgccag 


GGGACAGCA 


245 


TCGTGAGAG 


4 


420-532 


ttccccaacagct tgctccctctccatag 


ACTTTGAGC 


113 


CTGCGCTCG 


5 


535-695 


gggcaatgatggtggccttgtccctccag 


CAAAGCTCG 


163 


ACATCACAG 


6 


696-835 


cgtgggagtcactgtgtgcgtgtcaacag 


GCAACGAGC 


140 


GGGAAGCAG 


7 


836-1004 


tgacgcggggcttccctgggtgcgggcag 


GCAGAGCGG 


169 


TTGTGCATG 


8 


1005-1122 


gatttacagtgacttctcctgtccggcag 


AAAATCCCT 


118 


GTTCCAGTG 


9 


1123-1277 


atgggccctgacctcccttccctggccag 


GTACAAGGA 


155 


TGGTGAATG 


10 


1278-1440 


cagctgaccggccatctgtgtgcccacag 


TGCCCCCCC 


163 


GCGTAGTCT 


11 


1441-1567 


cctgcccttgcctcttctggtcctggcag 


CCGGCGGCG 


127 


AAGAATAAG 


12 


1568-1676 


aatggatccttgtgcctgtgctctcccag 


ACTGTGAGC 


109 


ATGTGACCA 


13 


1677-2039 


tgccacagggcttcttcccccctccacag 


CCATCCCCG 


363 


CGGTGCAGG 


14 


2040-2186 


cttagctccctctcgacccctgcccccag 


CCCTGGAAG 


147 


AAAAGTCTG 


15 


2187-2318 


gacccgcacctccaccccaccccctgcag 


GAGTCGACT 


132 


CCGTGGAAG 


16 


2319-2425 


cacacgccggcccctttctgcccacgcag 


GCTCCGAGG 


107 


ATGAGGAGG 


17 


2426-2561 


ccccactcctccctgtcccctccccgcag 


CCGGCCCAC 


136 


TGCACCTGG 


18 


2562-2666 


cacctgcaactggccccctgccccgacag 


GGAGAGTGC 


105 


TGCTGAAAG 


19 


2667-2780 


ggccggtgtgaggcccgtgtcccctccag 


AGGGCGCCA 


114 


AGCCGCAGG 


20 


2781-2869 


cgcacggccgccgcccggtcccgccgcag 


GCCCCCTCA 


89 


CCCTGCGCG 


21 


2870-3020 


gccctcgagccagcttcgtgcatccgcag 


GAGAAGTCT 


151 


ACCAAGAAG 


22 


3021-3115 


tcctgagcccacctggctccactgtgtag 


CTGAGGACC 


95 


TCCCGAAAG 


23 


3116-3238 


agcccccctgccgcctccccgcaccccag 


TGCATCCAC 


123 


AAGGGCAGT 


24 


3239-3350 


gcgggacaagcttccctctgtctccccag 


GCCCGGCTG 


112 


TCTCTCTGG 


25 


3351-3450 


ccccacccacctccccttcctgttgtcag 


GGGCCTCCC 


100 


TCCCGCCAT 


26 


3451-3556 


gcagtccagcagcccacgtgatcctgcag 


ACGCCGCAT 


106 


GGCCTGCAA 


27 


3557-3705 


ggcctggctgcctctcctgttcccgccag 


GAGGAAGAG 


149 


GGCCGCCAG 


28 


3706-3826 


cctcagcaccttctgatttctccccacag 


GTATTACAA 


121 


GGCTCTGTG 


29 


3827-3912 


ccacagcctggcttgtcctctccacacag 


GACAACCAG 


86 


CGGCTTCAG 


30a 


3913- 


gatcatgggacggggcccttccctcttag 


GTAG 


4 




30b 


3913- 


tactaacaccaccttccctgtcttggcag 


CTGTAAAGG 


199 


AGCTAC TAA 



Imron 

Splice donor size (kb) 

gtgagcgcggcgaacgggccacccgcccg >13.5 

gtactgggtccctctcccactggcaggcc 0.88 

gtgagacttggaggcgggccaggctggag 0.22 

gtacggccccaccccaaccccagcatccg 0.12 

gtaacagggctgtgccccgttcccagtaa 0.11 

gtgaggtcagcagcgctgccaggctgtcc 0.27 

gtatggcctgggaaaccagggcccttgtg 0.26 

gtaacagccctggctcctccccgaccccg 2.99 

gtaggtcagggacgggagagcgcacgggt 0.08 

gtgagtacagctccagcctcaggtcccct 1.81 

gtacaggccagccgcgctgctgagtgccc 1.09 

gtgagtgaccagaccagggcggggaggcc 0.83 

gtgaggctggccgcggggagggcggggac 0.29 

gtagggagggtggccctggcgaagggcag 0.10 . 

gtctggcctcagcctgcacccctcaactc 0.16 

gtgagtgtccctccccgctcctgatggag 0.30 

gtgaggccagcaccagcctgcccaaccca 0.40 

gtgtggggtcagcgggcggaaggggcagc 0.31 

gtacggagcgggcggcccgctgggacggc 0.14 

gtgagcgggcggcctgcggggcgccgcgg 0.10 

gtgagagcctggcctcttcccttttccta 1.74 

gtgagcttcccccgaaggcccttcagacg 0.40 

gtgagtgcaggccattgaggagagggaac 2.66 

gtgagtgcaggatggggtgccggtgggga 0.95 

gtgagcctccccatggccctgcaggtttt 0.40 

gtgagccccttccccaccctgttctacta 1.02 

gtcagctgtcctgcaggtccaggagtagg 1.30 

gtacttcacatgaagggtgggggctgcgc 0.85 

gtaagggcttcgtgagcctcctgcactgc 0.68 

NCS d 496bp 4.37 

NCS 1923 bp 



* Intron sequences are indicated by lowercase letters and coding sequences by capital letters. * bp: basepairs of cDNA (acc. no. X68203 Tor VEGFR-3S and 
S66407 for the COOH-terminal tail of VEGFR-3L), f CDS: coding sequence. d NCS: non-coding sequence before polyA signal. 



Analysis of the mouse Vegjr3 promoter activity in 
cultured cells 

To analyze whether the genomic sequences 5' of the 
VEGFR3 open reading frame confer endothelial cell- 
specific promoter activity, a 3 kb Kpnl-Notl fragment 
upstream of the first coding exon of the mouse Vegfr3 
gene was cloned into a luciferase reporter vector and 
transfected into cultured endothelial and nonendothe- 
lial cells. Cell extracts were assayed for luciferase activity 
(Fig. 2). As the lymphatic vessels originate from the 
vascular endothelium and no lymphatic endothelial 
cell lines are currently available, blood vascular endo- 
thelial cells were used to study the Vegfr3 promoter 
activity and cell type specificity in vitro. The Vegfr3 
expression in the endothelial cell lines used in the 
study was checked by Northern blot analysis. Mouse 
endothelial cells isolated from the lung (LE-II) did not 
express Vegfr3 whereas the brain endothelial cells did 
(data not shown). In LE-II and BEND cells, the 3 kb 
Vegfir3 promoter fragment showed 13% and 27% of the 
activity, respectively, of the SV40 promoter/enhancer 
used as a positive control. In NIH3T3 fibroblasts and 
MK-2 epithelial cells, the promoter showed much 
weaker activity (3% and 2%, respectively), indicating 
specificity for endothelial cells. However, the transcrip- 
tional activity of the 3 kb Vegfr3 promoter fragment in 
endothelial cells was weaker than the activity of a 1.1 kb 
Hindlll-Ncol Tie promoter fragment (47). 



Progressive 5'-deletions were made to the Vegfr3 
promoter to characterize sequences critical for the 
promoter activity. Figure 2 shows schematically the 
deleted Vegfr3 promoter fragments used in the trans- 
fection experiments. As only background promoter 
activity was detected in NIH3T3 and MK-2 cells, no 
conclusions about the effects of mutations to Vegfr3 
promoter activity were made based on the results 
obtained from these cell lines. The deletion of up- 
stream sequences leaving only a 1.6 kb Vegfr3 promoter 
fragment increased significandy (57%) the promoter 
activity in BEND cells, whereas in LE-II cells, the 
promoter activity was slighdy reduced (Fig. 2). Further 
deletions leaving only 858 or 819 bp of the sequences 
upstream of the Notl site further increased the pro- 
moter activity in both LE-II and BEND cells. However, 
upon further deletion, luciferase activity driven by the 
453 bp fragment was reduced by 40% in LE-II cells and 
50% in BEND cells. These results suggest that the 
KpnVArvll Vegfr3 promoter fragment upstream of the 
HR1 contains DNA sequences inhibitory for transcrip- 
tion whereas the HR1 itself contains transcriptional 
enhancer elements. 

Activity of the Vegfr3 regulatory sequences in 
transgenic mouse embryos 

To analyze whether the Vegfr3 regulatory region iden- 
tified in vitro is functional and specific to lymphatic as 
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Table 2. Polymorphic variations identified in the human VEGFR3 and the conditions for genotyping these polymorphisms 



Location 




Detection 






Annealing 


MgCl 2 




Nucleotide 


method 


Forward primer 


Reverse primer 


Tm (°C) 


(mM) 


Allele frequencies 


IniS-40 bp 3' cxS 


C toT 


Sma\ 


ccagctcctacgtgttcg 


ggcaacagctggatgtca 

m 


56 


1.0 


C « 0.75 T - 0.25 


IniS-55 bp 10 cx3 


CtoT 


Apal 


* 


* 


m 


C o 0.88 T = 0.12 


IniS-185 bp 3' 














cx3 


G to A 


DNA sequencing 




» 




m 


G - 0.97 A = 0.03 


IntS-200 bp lo 














ex3 


C to C 


A/uI 


m 


m 


• 


» 


G = 0.75 C - 0.25 


Ex4 (N149D) 


A445G 


Ssll/Acd 


» 


m 






A =* 0.99 G «= 0.01 


Ex4 (silent) 


G507T 


Hhal 




* 


• 




G = 0.63 T - 0.37 


Int7-65 bp 3' to 
















cx7 


G to A 


Stf 


ctgtgagggcgtgggagt 


gtcctttgagccactcga 


54 


1.5 


G o 0.48 A = 0.52 


Ini7-215 bp 3' to 
















ex7 


CtoT 


DNA sequencing 


• 


» 


» 


» 


C = 0.78 T - 0.22 


Int9-23 bp 3' to 
















ex9 


CtoT 


Wtal 


ccagcacaggcacctacacc 


agtgccactggat get gaga 


58 


1.5 


C = 0.90T« 0.10 


Ex 11 (T494A) 


A1480G 


Hhal 


ggtgactggatatgacaag 


gcctacagactgcaggaa 


58 


1.5 


A = 0.90 G - 0.10 


ExlS (P641S) 


C1921T 


DNA sequencing 


tcaccatcgaatccaagc 


agttctgcgtgagccgag 


56 


1.0 


C = 0.99 T - 0.01 


Intl5-82 bp 3' to 
















exl5 


C to G 


DNA sequencing 


ccttgggcaagtcgtggc 


gagagagactccatcagg 


54 


1.5 


C = 0.92 G = 0.08 
<CA) 9 = 0.79, 


Intl5-118 bp 3' 




Polyacrylamide 










(CA) l0 = 0.04, 


to exl5 


CA repeat 




m 




m 


m 


(CA),, - 0.17 


Intl 7-238 bp 3' 














to cxl7 


A to C 


Hlial 


catcaagacgggctacct 


ccgctgaccccacacctt 


56 


1.0 


A = 0.72 C - 0.28 


Intl 7-287 bp 3' 
















to cxl7 


CtoT 


DNA sequencing 


■ 


0 


w 


• 


C = 0.95 T - 0.05 


Intl 7-333 bp 3' 
















to cxl7 


CtoT 


Bspmei 


• 


m 


w 




C - 0.88 T = 0.12 


Int22-169 bp 3' 
















to ex22 


G toT 


Styl 


gaagctgaggacctgtgg 


gcggacgtagtcagggtc 


56 


1.0 


G = 0.74 T = 0.26 


Ex23 (silent) 


T3198C 


DNA sequencing 


gagttgacctcccaaggt 


tctcctggacaggcagtc 


56 


1.5 


T = 0.99 C = 0.01 


Int24-134 bp 3' 
















to ex24 


TtoC 


DNA sequencing 


gcagagtgacgtgtggtc 


gccctcatccttgtgccg 


58 


1.5 


T - 0.86 C = 0.14 


In t24-673 bp 3 
















to ex24 


A to G 


DNA sequencing 






m 


i» 


A — 0.58 G = 0.42 


Ex26 (R1146H) 


G3437A 


S/aNI 


cggcacaaggatgagggc 


aatgcaggtctcgccttg 


58 


1.5 


G = 0.84 A = 0.16 


In t28-586 bp 3' 
















to ex28 


TtoC 


DNA sequencing 


gctgagacccgtggttcc 


agctcaccttgaacgcgc 


54 


1.0 


T - 0.73 C = 0.27 


Int29-616 bp 3' 
















to ex29 


G to A 


DNA sequencing 


m 


* 






G - 0.65 A = 0.35 



opposed to blood vascular endothelium in vivo, we 
generated transgenic mouse embryos having the 3.6 kb 
HindUhNod promoter fragment upstream of the LacZ 
reporter gene. The embryos were stained and analyzed 
for ($-galactosidase expression at E15.5. At this develop- 
mental stage, the lymphatic vessels are developing, 
while the skin is still permeable to the 0-galactosidase 
staining reagents. Of 1 1 embryos positive for the 3.6 kb 
Vegfr3 promoter/ LacZ DNA, endothelial-specific trans- 



gene expression was observed in only one (Fig. 3A). 
Three other embryos with staining in a subset of 
endothelial cells also showed some ectopic expression. 
Two DNA-positive embryos showed only ectopic stain- 
ing and five did not stain at all. As the 3.6 kb Vegfr3 
promoter fragment could occasionally target expres- 
sion of the LacZ reporter gene to the endothelium of 
mouse embryos, we tested whether the 1 .6 kb Bsal-Notl 
promoter fragment could drive the reporter gene ex- 



Figure 2. Activities of mouse Vegfr3 promoter 
fragments used in transfected endothelial 
and nonendothelial cells. Restriction en- 
zyme cleavage sites used in making the de- 
leted constructs are shown along the Kpnl- 
Afefl promoter fragment. HR1 and HR2 are 
marked as orange and blue boxes, respec- 
tively. The lengths of the promoter frag- 
ments are indicated as base pairs above each 
construct. The promoter activities were ana- 
lyzed in LE-II and BEND endothelial cell 
lines. NIH3T3 fibroblasts and MK-2 epithe- 
lial cells were used as a nonendothelial cell 
controls. Promoter activity is presented as 
percentage of the pGL3 control, and the 
Tie-1 promoter activity is indicated for 
comparison. 
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Figure 3. Expression of the mouse Vegfr3 promoter constructs in 15.5 day p.c. mouse embryos. Comparison of the staining 
patterns obtained with the 3.6 kb (A) and 1.6 kb Vegfr3 promoter/ LacL (B) transgenes with that of the Vegfr3/LacZ knock-in 
embryos (Q. Note that the staining pattern is the same in all panels, although weaker in the case of the transgenes. The staining 
of a branching network is particularly evident in the neck region (arrows) and in the lateral regions of abdomen (arrowheads). 
Analysis of the skin sections from the corresponding embryos (D-F) indicates that the P-galactosidase expression is restricted to 
the lymphatic vessels (arrows). No 0-galactosidase activity was found in the adjacent blood vessels (arrowheads). 



pression specifically to the lymphatic endothelium. 
Based on whole mount analysis, of the 22 embryos 
positive for the 1.6 kb Vegfr3 promoter/LacZ transgene 
DNA endothelial-specific transgene expression was ob- 
served in two, while seven embryos had transgene 
expression also in nonendothelial cells. One of the two 
embryos having endothelial cell-specific expression is 
shown in Fig. 3B. As can be seen from the Fig. 3A, B, the 
P-galactosidase marker was expressed in a loose branch- 
ing network pattern in the skin in both cases. This 
pattern was particularly evident in the dorsal parts of 
the developing forelimbs and in the neck region. The 
morphology of these structures suggested that they 
represent developing lymphatic vessels that form in the 
skin at this stage of mouse development by an 
(lymph) angiogenic mechanism. A similar kind of stain- 
ing pattern, although much stronger and more exten- 
sive, was seen in the Vegfr3 LacL knock-in embryos (Fig. 
3C). No blue staining could be found in similarly 
treated transgene-negative embryos. 

The vascular specificity of the staining patterns was 
confirmed by analyzing tissue sections. Skin sections of 
the 3.6 kb Vegjr3 promoter/ LacL transgenic embryo 
revealed that although the 3-galactosidase staining was 
very faint, it was clearly restricted to the developing 
lymphatic endothelium (Fig. 3D). Analysis of the sec- 
tions from the 1.6 kb Vegfr3 promoter /LacZ transgenic 
embryos confirmed that this fragment was sufficient to 
drive P-galactosidase expression largely to the lym- 
phatic vessels (Fig. 3£)> although some veins were also 
weakly positive. However, the other transgenic embryo 
for the 1.6 kb Vegfr3/LacZ construct, which showed 



staining only in the lymphatics, had some weak staining 
in the chondrocytes. Analysis of tissue sections of the 
Vegfr3/LacZ knock-in embryo confirmed the endoge- 
nous pattern of expression and weaker activity of the 
promoter fragments (Fig. 3F). Staining of blood capil- 
laries or arteries was not observed in either case. 

As the 1.6 kb promoter fragment was able to drive the 
gene expression to the lymphatic endothelium in some 
of the embryos, we continued to study the reporter 
gene expression targeted by the 0.8 kb Spd-Notl Vegfr3 
promoter fragment, which showed stronger promoter 
activity in cultured cells. In the transgenic analysis of 
the 0.8 kb Vegfr3 promoter/LacZ construct, 20 embryos 
screened were transgene positive; of these, four em- 
bryos had staining in endothelial cells. Three also had 
ectopic staining in other cell types. However, even in 
the embryos showing endothelial cell-specific expres- 
sion of the transgene, the staining pattern was not 
restricted to the developing lymphatic endothelium. 
Strong promoter activity was found also in the develop- 
ing capillary-sized vessels, e.g., in the head (data not 
shown). Table 3 summarizes the results from the Vegfr3 
promoter/ LacL transgenic studies. 



DISCUSSION 

In this study, we have characterized the structure of the 
human VEGFR3 gene and both mouse and human 
VEGFR3 promoters. The coding sequence of the 
VEGFR3 gene is organized into 31 exons that closely 



VECFR3 STRUCTURE AND REGULATION 



1033 



Tabic 3. Summary of the in vivo activity of VegfrS promoter 
fragments in transgenic mouse embryos at El 5.5 



Promoter fragment 


TG a 


ES* 


ES/ET' 


ET" 


NO' 


3.6 kb Hindm/Notl 


11 


1 


3 


2 


5 


1.6 kb Bsal/Notl 


22 


2' 


7 


0 


13 


0.8 kb Spd/Notl 


20 


1 


3 


1 


15 



• TG: number of transgenic embryos. 6 ES: embryos showing 
LacZ staining only in endothelial cells. r ES/ET: number of 
embryos showing ectopic and endothelial staining. rf ET: number 
of embryos showing only ectopic staining. 'NO: number of 
embryos with no staining. Aveak staining was observed in a 
subset in chondrocytes in one of the embryos. 

correspond to the genomic organization of the mouse 
Vegfrl and human VEGFR2 genes (44, 45). In our 
analyses, we also identified many polymorphisms of the 
VEGFR3 sequence. The data concerning the exon- 
intron boundaries and polymorphisms is important 
and useful from the clinical point of view as we have 
recently demonstrated that VEGFR3 missense mutations 
are a cause of familial early onset lymphedema (41, 42). 
Due to the complexity and heterogeneity of this dis- 
ease, it is likely that several other VEGFR3 mutations 
may also be found in different lymphedema families. 
Using the information about the VEGFR3 genomic 
structure, intragenic polymorphisms and linkage stud- 
ies in families with lymphedema or other inherited 
abnormalities of the lymphatic system should allow 
rapid screening for mutations in VEGF/?3-linked fami- 
lies. Since the endothelial cell-associated genes are 
candidate loci for vascular dysmorphogenesis syn- 
dromes and hereditary angiogenic disorders, analysis of 
VEGFR3 mutations from patients suffering from lym- 
phatic disorders might reveal the molecular basis of 
such phenotypes. 

We have also isolated and partially characterized the 
genomic regions upstream of the mouse and human 
VEGFR3 genes. The upstream sequences showed -70% 
identity in two distinct regions, which represented the 
putative 5' enhancer/promoter and 3' minimal pro- 
moter elements, respectively. No classical TATA or 
CAAT boxes were found in VEGFR3. The 3 kb fragment 
of the 5 '-flanking sequence of the mouse Vegfr3 dem- 
onstrated activity in cultured endothelial cells but only 
minimal activity in keratinocytes and fibroblasts. Dele- 
tion studies of the Vegfr3 promoter indicated that both 
homology elements were needed for activity. The 5' 
homology element had conserved putative binding sites 
for transcription factors conforming to the MEF-2, 
NF-1, AP-2, GATA, and Ets families. Two Ets family 
transcription factor binding sites, PEA3 and E1A-F, 
were conserved in the mouse and human VEGFR3 
promoters. Sites for Ets-related transcription factors are 
also found in all other known promoters showing 
endothelial specificity in vivo, such as in the promoters 
of genes encoding Tie-1, Tie-2, VE-cadherin, ICAM-2, 
and vWF (29-31, 33, 35-37). The 3' homology region 
contains multiple conserved binding sites for Spl like 
factors as well as sequences that resemble the transcrip- 



tion start sites found in genes that do not contain a 
TATA box. The transcription factors controlling the 
VEGFR3 gene expression remain unknown and further 
studies are needed to determine which consensus bind- 
ing sites are critical for the VEGFR3 promoter activity. 

We also studied the ability of three Vegfr3 promoter 
fragments to target gene expression in the developing 
lymphatic endothelium in vivo. Our results from trans- 
genic embryos at E15.5 indicate that the 3.6 kb and 1.6 
kb Vegfr3 promoter fragments contained specific ele- 
ments sufficient to direct gene expression to the lym- 
phatic endothelium. However, as only few of the trans- 
genics showed a similar staining pattern to that of the 
Vegfr3/LacZ knock-in embryos, the gene expression 
driven by these promoter fragments is apparently very 
sensitive to interference by the transgene integration 
site and further enhancer elements needed for repro- 
ducible endothelium-specific reporter gene expression 
are likely to be missing from these promoter fragments. 
Many tissue-specific gene regulatory elements are lo- 
cated within the first two introns, although such ele- 
ments occasionally can be found at a great distance, 
even in the 3' parts of genes. Vegfrl and Tie2 are 
examples of endothelial cell-specific genes whose activ- 
ity is partly regulated by enhancers located in their first 
introns (31, 32). Therefore, we are currendy analyzing 
the large first introns of the mouse and human Vegfr3 
genes to locate putative enhancer elements. We have 
sequenced the first intron of the mouse Vegfr3 consist- 
ing of 15 kb of genomic DNA. When compared with the 
human VEGFR3 first intron, which has recendy become 
available in the GenBank, the sequences were found to 
be —70% homologous. Several short regions showing 
high homology were found, but so far no additional 
enhancers could be identified in transfection studies 
using various DNA fragments from the mouse intron 
(unpublished data of the authors). 

Lymphatic vessels are quite different from arteriae, 
veins, and capillaries in structure. The lymphatics are 
characterized by an extremely permeable, thin endo- 
thelial lining devoid of a basal lamina. In addition, the 
small lymphatic vessels typically lack supporting cells, 
such as pericytes or smooth muscle cells (21, 48). These 
differences indicate that the lymphatic endothelial cells 
represent a differentiated form of endothelial cells. 
Differentiation of endothelia to arterial, venous, and 
lymphatic lineages has not been understood at a mo- 
lecular level. Recent evidence suggests that arteries and 
veins are genetically determined at the earliest stages of 
vasculogenesis (49, 50). It is unclear at precisely what 
stage lymphatic vessels are determined. One possibility 
is that VEGF-C and VEGF-D expressed by smooth 
muscle cells of arteries and veins would be involved in 
directing the formation of lymphatic vessels around 
them (38). The isolation of the VEGFR3 promoter now 
makes it possible to identify the transcription factors 
that contribute to lymphatic development using tech- 
niques of mouse molecular genetics. The relevant 
transcription factor binding sites in addition to the 
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coding sequence may also be targets of mutations in 
human lymphedema. 

Previous studies have aimed to identify promoter 
elements necessary for uniform vascular endothelial 
cell-specific gene expression. However, a major obsta- 
cle in gene therapy directed to endothelial cells is the 
fact that leakage of catheters and blood flow rapidly 
washes away delivery vehicles, resulting in undesired 
gene expression in distant organs, particularly in the 
liver (51). Therefore, promoters that restrict expres- 
sion of transgenes to particular subsets of endothelial 
cells may be more desirable in such settings. In hered- 
itary lymphedema with reduced VEGFR3 signaling in 
heterozygous affected individuals (41, 42), genes that 
would induce VEGFR-3 signaling specifically in the 
lymphatic endothelium might improve the growth and 
function of lymphatic vessels without side effects in 
other tissues. Another potential use for the Vegfr3 
promoter would be in 'partial ' rescue experiments, as 
null mutations in several endothelial-specific genes of 
mice are known to lead to embryonic lethality at an 
early stage. Although the transcriptional activity of the 
Vegfr3 promoter is relatively weak, identification of 
additional endogenous enhancers or the use of en- 
hancers increasing promoter activity without losing its 
cellular specificity might increase the efficacy of Vegfr3 
promoter in the potential applications described above 
(52). (fj] 
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